Physica A: Statistical Mechanics and its Applications — Latest Matching Preprints

1

Critical Scaling Laws and Universality Classes in Biomolecular Condensates

Song, H.; Hu, G.; Wu, X.; Zhang, X.; Li, J.

2026-06-29 biophysics 10.64898/2026.06.24.734243 medRxiv

Top 0.1%

4.8%

Show abstract

Biomolecular condensates are widespread cellular self-assembled structures with essential functions. There are suggestions of condensates formed by different proteins being near criticality. However, systematic investigation of the criticality of condensates is absent, and critical exponents defining their universality class have not been found. Here, using long-time simulations, we show that condensates exhibit typical critical phenomena, including scale-free spatiotemporal correlations, critical slowing down, divergence of correlation length and dynamic scaling. From these scaling behaviors, a set of critical exponents is determined. Based on dynamic critical exponent, diverse condensates can be divided into two distinct universality classes, arising from differences in their molecular components and interaction types.

2

Modeling the Effectiveness of Antibiotic Therapies Against Sepsis Using Continuous-time Hidden Markov Models

Schmiegel, S.; Marchi, H.; Borgstedt, R.; Rehberg, S.; Fuchs, C.; Mews, S.

2026-07-10 health informatics 10.64898/2026.07.03.26357092 medRxiv

Top 0.1%

2.7%

Show abstract

Patients suffering from sepsis need to be treated with an effective antibiotic therapy within the first hour after sepsis onset to decrease their risk of death. Microbiological data that provide information about the suitability of antibiotic therapies, however, is usually available only after 72 hours. Consequently, the treating physicians need to judge a therapy's effectiveness based on the patients' measured health records and their general health condition. This medical assessment is complex and requires years of experience. In our study, we investigate how statistical modeling can contribute to assessing the effectiveness of antibiotic therapies. To that purpose, we describe the effectiveness of antibiotic therapies by modeling sepsis patients' health conditions using a three-state continuous-time hidden Markov model (ctHMM). In literature, procalcitonin (PCT) and lactate have proven to be helpful for deriving the health condition in this context. The state probabilities obtained by the ctHMM are subsequently used to quantify the effectiveness of antibiotic therapies. To this end, we apply two different approaches, namely (i) averaging of the state probabilities and (ii) a logistic regression model. For (i), we calculate the average of the state probabilities for the state indicating a sepsis-free condition over an antibiotic administration period of 48 hours. For (ii), we use the information about antibiotic susceptibility testings as dependent variable in the logistic regression model; as independent variables, we calculate the difference between state probabilities at the start of antibiotic administration and 48 hours later. With this work, we are able to better understand the relationship between laboratory values, in particular PCT and lactate, and the patients' health condition. We further provide approaches for quantifying the effectiveness. Therefore, our work contributes to developing a clinical decision support system which helps physicians assess the effectiveness of antibiotic therapies in patients with sepsis. Supported by such a system, a physician is able to quickly adjust an ineffective therapy which avoids antibiotic resistances and increases a patient's chance to survive a sepsis.

3

Graph Neural Networks (GNNs) for Protein-Ligand Interaction Prediction

Khilar, S.; Natarajan, E.

2026-04-24 bioinformatics 10.64898/2026.04.23.720519 medRxiv

Top 0.1%

2.1%

Show abstract

Predicting protein-ligand interactions in the modern drug discovery has revolved from the involvement of artificial intelligence and structural bioinformatics using Graph Neural Networks (GNNs). The limited explainability of GNN models presents an important encumbrance in biomedical research, but it has achieved a high degree of accuracy in determining and identifying binding affinity and active compounds, as evidenced by [1] [2] [3] [4]. Here this research focuses on the interpretation of protein-ligand interactions at a molecular level, a rapidly developing area within Graph Neural Networks (GNNs). Now days modern study handling techniques such as visualization techniques, attention mechanism and model-based feature ascription by model to boost, and make robust and decrease false predictions on binding. Along with some approaches include like graph pooling strategies, message-passing optimization, self-supervised learning, transfer learning and contrastive learning are rapidly utilized to enhance the representative learnings. Furthermore, integration of molecular docking simulations, hybrid deep learning architectures and protein language model gives more reliable & biological predictions of protein-ligand interactions. That focuses on given process that identifies key ligand atoms and binding residues, as well as physicochemical factors influencing affinity, through chemical thought processes. Here this research work identified the challenges of developing biologically significant explanations, transparency, and the corollary dataset biases on interpretability. The research work conducted an in-depth investigation into the consolidation of protein language models to establish more reliable pathways for future research, examining hybrid architectures, transparent and energy-efficient GNNs, and scientifically grounded AI models for drug discovery. My research work highlights that XGNNs establishes a connection between Deep Learning and Biochemical expertise with increased confidence, which will enhance the accuracy of predictive models and computational models.

4

Enhancing dengue diagnosis and surveillance by integrating machine learning technologies with the NS1 rapid test kit

Hwang, C.-K.; Chen, Y.-W.; WANG, Y.-T.; Ho, T.-S.; Oyang, Y.-J.

2026-05-06 health informatics 10.64898/2026.05.05.26352445 medRxiv

Top 0.1%

1.7%

Show abstract

BackgroundDengue has been a major health threat globally in recent years. In particular, dengue incidences continue to increase annually and the epidemic area has expanded primarily due to global warming. Therefore, effective case detection and surveillance strategies are crucial to tackle this global health challenge. In clinical practice, the rapid test kit detecting dengue non-structural protein 1 antigen and commonly referred as NS1, is widely employed for early diagnosis. However, real-world studies revealed that the sensitivity of the NS1 test kit ranged from approximately 61% to 95%. Since early diagnosis is really critical for disease surveillance in the early stage of a dengue epidemic, scientists have been working hard to develop novel diagnosis methods that can provide higher sensitivity levels. Methodology/Principal FindingsIn response to this challenge, in this study, we have developed a novel diagnosis procedure that integrates machine learning technologies with the NS1 test kit. Our experimental results revealed that we would be able to raise the sensitivity of the dengue diagnosis procedure to higher than 99% by incorporating machine learning based prediction models to screen the suspected patients with a negative NS1 result. Furthermore, the relative risks between the suspected patients who were predicted to be positive and those who were predicted to be negative exceeded 4.8. Conclusions/SignificanceThese results illustrate that the proposed approach provides an effective and efficient diagnosis procedure to address the global health challenge caused by spread of dengue. Author SummaryThis study has aimed to enhance surveillance of the dengue disease by integrating machine learning technologies with the rapid test kit commonly employed in early diagnosis. In clinical practice, the NS1 rapid test kit is widely employed for early diagnosis. However, real-world studies revealed that a certain percentage of the patients with a negative NS1 test result, ranging from 5% to 39%, were actually infected by dengue. Since early diagnosis is critical for disease control in the early stage of a dengue epidemic, scientists have been working hard to tackle this challenge. Based on this observation, this study was launched to investigate the effects of incorporating machine learning based prediction models to further screen those patients with a negative NS1 test result. The experimental results revealed that the proposed approach was able to identify over 99% of the patients who were infected by the dengue disease. Furthermore, the risk of the suspected patients who were predicted to be positive was 4.8 times higher than the risk of those who were predicted to be negative. The experimental results illustrate that the proposed approach provides an effective and efficient diagnosis procedure to enhance surveillance of the dengue disease.

5

The Quantum Environment in Cryptochrome Enhances Light Absorption of FAD

Wieners, L.; Garcia, M. E.

2026-04-28 biophysics 10.64898/2026.04.24.720615 medRxiv

Top 0.1%

1.7%

Show abstract

The light absorption of the protein cryptochrome and its chromophore FAD is important for the regulation of circadian rhythms and in some species for sensing magnetic fields. To compute the absorption spectrum of chromophore, typically only a small region is treated quantum-mechanically due the high computational cost of spectroscopic calculations. We present a formalism that allows a quantum-mechanical treatment of not only the chromophore but also the neighbouring amino acids which differ from species to species. This is achieved by using the real-time time-dependent Hartree-Fock method. This method allows extending the quantum domain from typically only a few dozen atoms up to around 1,200 atoms for the largest calculations. The presented framework allows the treatment of neighbouring tryptophan residues or the cofactor molecule MTHF in the same calculation and allows to extract information of which regions absorb light depending on wavelength. The presented results also show that the environment around the chromophore FAD amplifies the light absorption in cryptochrome.

6

Protein hydration and druggability

Panasenko, S.; Khorev, V.; Petukhov, M.

2026-07-08 biophysics 10.64898/2026.07.06.736750 medRxiv

Top 0.1%

1.5%

Show abstract

A priori assessment of target proteins' druggability remains an unsolved problem in the field of drug development. The empirical approaches widely used to solve this problem demonstrate low efficiency. In this work, we investigated the factor of hydration of a representative set of 65 evolutionarily and structurally unrelated human enzymes in a water environment. This factor depends only on the structure of the proteins, and not on the physical and chemical properties of any potential ligands. The results show that, unlike the widely used approaches based on calculations of the accessible surface area (ASA), the content of low-entropy water molecules (LEW) in the active sites of human enzymes is systematically higher than that in other areas of their surface, including inactive cavities. Optimal criteria and a step-by-step procedure for identifying protein ligand binding sites are proposed. The proposed approach, based on the calculation of the LEW content in the first hydration layer of potentially interesting target proteins, makes it possible to evaluate their medicinal suitability even before the development of any ligands. The article also presents the results of a comparative analysis of experimental Raman spectroscopy data and the results of molecular dynamics simulations of water hydrogen bonds using three widely used water models (TIP3P, OPC3, and TIP5P) and standard algorithms for calculating hydrogen bond networks.

7

Fidelity-Derived Quantum Dissimilarity-Enhanced k-Nearest Neighbor Algorithm for Arterial Hypertension Prediction

Tampakaki, A. E.; Barmparis, G. D.; Angelaki, E.; Marketou, M. E.; Tsironis, G. P.

2026-06-16 health informatics 10.64898/2026.06.08.26355139 medRxiv

Top 0.1%

1.4%

Show abstract

We present a quantum-enhanced version of the classic k-Nearest Neighbors (kNN) classification algorithm, applied to the prediction of arterial hypertension. The traditional Euclidean distance metric of the kNN algorithm is replaced with a Fidelity-derived quantum dissimilarity measure to evaluate the similarity between data samples. We map classical real-world clinical and ECG-derived data features into quantum states via the Dense-Angle Encoding, which efficiently utilizes parameterized rotation gates to pack multiple features into minimal qubits while maintaining pure states. We evaluate the performance of the dissimilarity measure using both the noiseless state vector Simulator and the IBM Qiskit Estimator primitives. The quantum circuit demonstrates robust predictive capabilities comparable to the classical model. While it does not claim computational supremacy over the classical baseline, the framework proves that fidelity-based similarity is a physically meaningful and efficient approach for hybrid quantum classical classification.

8

A Minimal Stochastic Model of Microbial Ecological Dynamics in a Single-Species-Single-Resource Setting

Leung, C. F. A.; Kolomeisky, A.

2026-07-03 biophysics 10.64898/2026.07.01.735782 medRxiv

Top 0.2%

1.3%

Show abstract

Microbes exhibit complex dynamic behavior as the result of a large number of biochemical processes, spatial and temporal interactions, environmental variations, and evolutionary pressure. Although significant progress has been achieved in understanding microbial ecological dynamics, multiple open questions remain, including the microscopic mechanisms of growth and the roles of nutrients and stochasticity. In this work, we present a minimal theoretical approach to clarify the link between consumption of resources by microbes and their growth. A stochastic model that accounts for a single microbial species consuming a single type of resource while growing via cell division is studied analytically and via Monte Carlo computer simulations. We identify three distinct dynamical regimes of microbial growth determined by the relative magnitudes of resource uptake and division rates and initial conditions. We also show that stochasticity influences the dynamic behavior when the amounts of microbes or resources are low. The model recovers Monod growth kinetics and provides a mechanistic interpretation of the Monod constant and maximal growth rate. The theoretical framework presented captures a wide spectrum of dynamic behaviors in microbial systems, providing a clearer microscopic picture to explain their underlying complex mechanisms.

9

Numerical study of spatial and temporal dynamics of integrin clustering during early cell adhesion

Tsukui, K.; Kawai, T.; Miyoshi, H.; Sakamoto, N.; Wakimura, H.; Ii, S.

2026-06-11 biophysics 10.64898/2026.06.07.730653 medRxiv

Top 0.2%

1.1%

Show abstract

Integrins are adhesion proteins that diffuse along the cell membrane, bind to ligands, and cluster with each other in the early stage of cell adhesion. Integrin clustering and its specific spatial distribution play important roles in subsequent biological processes; however, the mechanisms that give rise to their characteristic spatial distribution remain poorly understood. To address this issue, we developed a cell adhesion model that incorporates cell membrane deformation and integrin dynamics. A hybrid continuous/discrete model was applied to represent membrane deformation, whereas Brownian dynamics combined with a transition state model was used to describe integrin dynamics and binding kinetics. Comparison of numerical simulations of cell adhesion to a substrate with experimental observations at the early stage of adhesion successfully reproduced the characteristic spatial distribution of integrin clusters, in which high-density clusters formed at the periphery of the region adhering to the substrate. These results suggest that the cellular-scale distribution of integrin clusters can be reproduced using only minimal elements, such as adhesion-driven membrane deformation and integrin-ligand binding. In addition, we found that the strength of integrin-ligand binding regulates the degree of clustering by changing the size of the part of the membrane that is deformed, thereby mechanically supporting the mechanical involvement of the actin cytoskeleton in integrin clustering. Furthermore, the formation and spatial distribution of integrin clusters were shown to be determined not only by the static mechanical equilibrium of membrane deformation and physical adsorption, but also by membrane spreading/deformation and the dynamic behavior of integrins. This suggests that the size and spatial distribution of integrin clusters may be controllable by modulating the speed of membrane spreading.

10

Spanning-Tree Thermostatistics of Protein Allostery: An Exact Kirchhoff Framework with Application to Oncogenic KRAS

Senguler Ciftci, F.; Erman, B.

2026-05-01 biophysics 10.64898/2026.04.29.721570 medRxiv

Top 0.2%

1.1%

Show abstract

This study introduces a statistical mechanical framework for allosteric communication in proteins based on the spanning-tree ensemble of residue contact networks. By representing protein structures as weighted graphs, we identify each spanning tree as a topological microstate. The canonical partition function is evaluated exactly via the determinant of the reduced weighted Kirchhoff (Laplacian) matrix, allowing for the derivation of global thermodynamic functions (including Helmholtz free energy, internal energy, entropy, and heat capacity) without approximation. Allosteric channels between specific residue pairs are defined as sub-ensembles containing unique simple paths. Using the Burton-Pemantle theorem and the Moore-Penrose pseudoinverse of the graph Laplacian, we compute exact path probabilities and channel-specific thermodynamics. This methodology enables a decomposition of channel heat capacity into energetic and topological components and quantifies residue-level allosteric importance through fractional contributions to the channel partition function. The framework was applied to the G12D mutation in KRAS, comparing wild-type (PDB: 6GOD) and mutant (PDB: 6GOF) proteins. Results show that while the mutation minimally affects mean internal energy and entropy, it reduces global heat capacity by 27.3%. This indicates a topological stiffening where the mutant occupies a significantly narrower landscape of spanning-tree configurations. At the channel level, the mutation maintains distributional stability across six functional routes but triggers a substantial internal redistribution of allosteric importance. Specific residues, such as Q61 and F156, shift occupancy by up to 35.5%. These findings suggest that the G12D mutation does not destroy communication pathways but reorganizes internal information traffic to favor a catalytically impaired state. This approach provides a rigorous, parameter-free metric for understanding how point mutations perturb distal protein signaling.

11

Probability of Antibiotic Resistance During Treatment in Stochastic PK/PD-Based Bacterial Model with Distinct Drug and Mutation Modes

Izuazu, C.; Browne, C.

2026-06-20 evolutionary biology 10.64898/2026.06.17.732999 medRxiv

Top 0.2%

1.1%

Show abstract

Mathematical models, e.g. differential equations and stochastic processes, have gained considerable attention for understanding evolution of antibiotic resistance. However, most existing models assume standing genetic variation and do not consider the possibility of random or drug-induced mutation of reference bacterial strains. Therefore, we propose a pharmacokinetics/pharmacodynamics (PK/PD)-based continuous-time Markov chain considering the competition and mutation between sensitive and resistant bacterial within an infected host during treatment. The proposed model is approximated as a generalized birth-death process with immigration, allowing for explicit derivation of the probability resistant population establishes during treatment. Besides capturing the stochasticity of de novo emergence of a resistant bacterial strain, we explore the effects of different antibiotic modes of action, horizontal gene transfer, nutrient availability and drug pharmacokinetics on antibiotic resistance. We find that replication-targeting (biostatic) drugs suppress resistance more than death-targeting (biocidal) drugs. Like prior works, we obtain maximized resistance at intermediate drug concentrations, however the consideration of de novo mutation magnifies the superiority of higher doses in preventing resistance emergence.

12

A Pipeline for Solving Edge-Matching Puzzles and Their Implications for Protein Folding

Seifer, S.

2026-05-24 biophysics 10.64898/2026.05.23.727379 medRxiv

Top 0.2%

1.0%

Show abstract

Progress in quantum computation offers new opportunities for addressing longstanding combinatorial challenges. One such challenge is the Eternity II edge-matching puzzle, consisting of 256 tiles, which has resisted solution despite extensive community effort. The computational complexity of this NP-complete problem exceeds the capacity of current quantum annealing processors but lies within reach of hybrid quantum-classical solvers. Testing a quadratic unconstrained binary optimization (QUBO) model of a puzzle on a D-Wave hybrid solver demonstrates a complete solution only for puzzle instances up to 64 tiles. Simulated quantum annealing fails on this benchmark, whereas an original classical heuristic, "nucleation with deduction", succeeds. To approach the full Eternity II puzzle, I developed a MATLAB package that integrates multiple quantum and classical approaches, including neural-network transformers and gradient-based refinement. A multistage computation pipeline is demonstrated successfully on a puzzle comparable in complexity to Eternity II and with a known solution, based on multiple hybrid optimization steps with both "hard" and "soft" constraint formulations, identification of persistent substructures, and a final classical refinement stage. The resulting optimization problem involves [~]100,000 logical variables and requires partial initialization. Intriguingly, solving this puzzle mirrors the "end game" of protein folding, a process that nature completes in mere fractions of a second, seemingly defying expectations set by the Levinthal paradox. The prospect of predicting protein structure by quantum annealing is reviewed in light of these results.

13

Elasto-Osmotic Phase Separation in Confluent Cellular Tissues

Michels, J. J.

2026-06-02 biophysics 10.64898/2026.05.29.727481 medRxiv

Top 0.2%

1.0%

Show abstract

Biomolecular condensates that form via liquid-liquid phase separation (LLPS) of, most prominently, intrinsically disordered proteins (IDPs) are ubiquitous in eukaryotic cells and responsible for regulating a plethora of biological functions. Amongst these, they contribute to regulating cell motility, either individually within an extracellular matrix or collectively within confluent epithelial tissue. In this computational study we focus on the latter with the aim of investigating whether the mutual exertion of mechanical forces during collective migration in an epithelium can principally trigger cytoplasmatic LLPS. Since present models for confluent epithelial motility have so far only considered cells that are devoid of phase separating (protein) solutes, we extend a common multiphase approach for 2D cell motility with a mixing contribution including any number of protein solutes. Our model considers the phase behavior in both intracellular and extracellular regions and determines to what extend the membrane is permeated by the solutes under the influence of mechanical and osmotic forces. Our initial calculations unlock a very rich behavior involving formation and dissolution of condensates during migration, as well as an impact of LLPS on the very nature of the motility itself, through feedback mechanisms which may bear biological relevance.

14

Traveling Wave Analysis of a Go-or-Grow Invasion Model with ECM-Regulated Phenotypic Switching

Sadhu, G.; Jolly, M. K.; Maini, P. K.

2026-04-27 systems biology 10.64898/2026.04.23.720361 medRxiv

Top 0.2%

1.0%

Show abstract

Experimental studies show that tumor cells adopt migratory or proliferative phenotypes depending on the local extracellular matrix (ECM). In this work, we propose a minimal go-or-grow invasion model, comprising two specialist cell phenotypes: proliferating and migratory, with phenotypic switching and cell migration depending on local ECM density. Numerical simulations of this model reveal that the spatial arrangement of proliferative and migratory cells depends on the choice of phenotypic switching function. We then ask whether this specialist cell-phenotype model can be reduced to a generalist cell-phenotype model. We derive a relationship between the reduced model and go-or-grow model in the fast phenotypic switching regime. We observe that the reduced model captures the dynamics of the original model, for a range of realistic phenotypic switching functions. We analytically derive the minimum traveling wave speed of the reduced model in a homogeneous ECM bed. Moreover, using linear stability analysis on the go-or-grow model, we recover the same wave speed expression. In addition, we numerically explore how the key parameters influence the traveling wave speed profile. Our analysis indicated the counter-intuitive result that the wave speed is independent of the matrix degradation rate, and our simulations show that, at most, the speed is weakly dependent on this parameter.

15

A self-consistent model for phase separation and active processes in biomolecular condensates

Di Mambro, M.; De Los Rios, P.

2026-06-02 biophysics 10.64898/2026.06.01.729289 medRxiv

Top 0.2%

0.9%

Show abstract

Biomolecular condensates are thought to play a pivotal role in cellular organization by regulating biochemical reactants in space and time. Sustained molecular fluxes across condensate boundaries, together with the participation of phase-separating molecules in active chemical reactions such as ATP hydrolysis, call for a nonequilibrium description. Here, we propose a self-consistent framework in which diffusion-drift dynamics and chemical reactions are coupled through a conditional free energy, defined as the excess contribution to the chemical potential. Self-consistency is achieved by deriving this quantity from the same free-energy functional that governs molecular interactions and phase separation. We apply the framework to a minimal client-scaffold system and investigate how active chemical processes and phase separation interact at steady state. In doing so, our approach recovers the fundamental rules previously identified for the emergence of nonequilibrium steady-state fluxes. The model shows that active reactions involving the scaffold molecules can regulate the phase behavior of the condensate. Moreover, nonequilibrium steady-state fluxes are maximal near the boundary between the phase-separated and homogeneous regimes, suggesting that condensates sustaining molecular transport may operate close to their stability threshold. In the same region, client fluxes are also enhanced, revealing an indirect coupling between scaffold activity and client transport. These results provide a baseline for developing more detailed theories of chemically active condensates.

16

Time-step restrictions for numerical approximations of the Poisson-Nernst-Planck (PNP) equations

Jaeger, K. H.; Tveito, A.

2026-05-06 biophysics 10.64898/2026.04.30.721819 medRxiv

Top 0.3%

0.9%

Show abstract

The Poisson-Nernst-Planck (PNP) system is an accurate model of electrodiffusion of ionic species. It is commonly used in situations where nanoscale resolution is required, for instance close to ion channels in the membranes of biological cells. The inherent stiffness of the equations has made them challenging to solve and has limited the applicability of the system. In particular, the time step required for stable solutions has typically needed to be very short (nanoseconds), which makes simulations on the time scale of an action potential (milliseconds) difficult. Recently, it has been observed that avoiding operator splitting and instead solving the concentration equations and the electrostatic equation in a coupled manner relaxes the time-step limitation considerably. However, no theoretical explanation of this observation has been provided. Here, we aim to explain why the coupled scheme allows much larger time steps. We illustrate the mechanism by considering special cases that define necessary, but not sufficient, conditions for stability. We also show that these conditions remain relevant for the fully coupled PNP model in 3D.

17

A Beta-Binomial Model for Estimating Zero- or One-inflated Pain Trajectories

Liu, Y.; Harris, R. E.; Clauw, D.; Bayman, E.; Leroux, A.; Lindquist, M. A.

2026-05-11 bioinformatics 10.64898/2026.05.07.721507 medRxiv

Top 0.3%

0.6%

Show abstract

Chronic pain is a widespread public health issue that imposes substantial health, emotional, and economic burdens on individuals and communities. Because pain is subjective and lacks objective biomarkers, it is typically measured using patient-reported scores, often on a numerical scale from zero to ten. Increasingly, pain studies use ecological momentary assessment, with multiple daily assessments over days and across study phases (e.g., a series of baseline and post-intervention assessments). These data frequently show many ratings at the extremes (i.e., at minimum or maximum pain scores), commonly referred to as zero- and one-inflation in the statistical literature, along with considerable within-person variability both within and across days. These phenomena present challenges for statistical analyses, as they violate assumptions of most commonly used statistical techniques (e.g., the normality assumption of linear mixed models). We propose a Bayesian beta-binomial mixed-effects model for modeling potential zero- or one-inflated pain scores while accounting for variability using random effects on the mean and variance parameters across subjects. A simulation study demonstrates that the method accurately estimates model parameters across realistic sample sizes, time points, and zero- and one-inflation levels. An application to data from two longitudinal pain studies demonstrates that the model fits the data better and, when correctly specified, yields accurate uncertainty intervals for longitudinal changes in pain compared to existing models, especially for zero- and one-inflated outcomes. Additionally, the model directly estimates the probability of clinically meaningful pain events. The proposed method provides a powerful statistical framework for studying the patient-reported pain trajectories.

18

From Big Bang to Biochemistry: Entropy-Oriented Mechanics and Information Force Fields as a Unifying Framework for the Origin of Carbon-Based Life

Truong, Q. H. X.; Truong, X. K.

2026-04-24 biophysics 10.64898/2026.04.21.719958 medRxiv

Top 0.3%

0.6%

Show abstract

The emergence of amino acids (AAs) and nucleobases (NBs) across meteorites, interstellar ices, and laboratory shock experiments presents a paradox: why do these specific molecular motifs--a minuscule subset of organic chemistrys combinatorial space--appear repeatedly across diverse environments, in the absence of biological selection? We identify a physical mechanism, prebiotic selection, which biases driven chemical systems toward configurations with high stationary probability p*(x) under sustained entropy flux. The bias is quantified by an information quasi-potential {Phi}I (x) = - ln p*(x), entering the overdamped Langevin dynamics O_FD O_INLINEFIG[Formula 1]C_INLINEFIGM_FD(1)C_FD where {Sigma} is the local entropy production rate (Schnakenberg 1976). {Phi}I is defined self-consistently via the full non-equilibrium stationary density, avoiding the circularity of identifying it with a scalar potential. Two central theorems underlie the framework. Theorem 1 establishes that {nabla}{Sigma} and {nabla}{Phi}I are generically linearly independent off equilibrium, so the dynamics is genuinely two-field. Theorem 2 (structural constraints on single-field gradient dynamics) shows that single-field models on compact manifolds (i) produce yield curves that are at most unimodal under linear driving, and (ii) combine disjoint perturbations additively, giving superlinearity factor S = 1 + O(||{delta} V ||2). The observed superlinear synergy of Ferris et al. (1996) lies far outside this perturbative bound and therefore requires the two-field structure of EOM-IFF; the non-monotonic peak of Blank et al. (2001) is consistent with two-field dynamics and also with single-field dynamics in the unimodal-with-peak case of Theorem 2 part 1, so it does not by itself discriminate. From these results, we: (i) define a formal substrate-minimal criterion for prebiotic selection; (ii) show consistency with the non-monotonic shock-synthesis yield of Blank et al. (2001) (R2 = 0.885, peak at P* = 28.4 {+/-} 1.4 GPa); (iii) show consistency with the superlinear clay-catalysed RNA polymerisation of Ferris et al. (1996) (synergy factor S {approx} 5.75, robust under {+/-}1-nucleotide measurement uncertainty); and (iv) state two further falsifiable predictions awaiting dedicated experimental tests. Every lemma and theorem is accompanied by explicit assumptions, regime of validity, and regime of failure; the frameworks scope is what it claims, not more. Prebiotic selection is identified as a physical process distinct from and prior to biological selection, offering a unified account of chemical convergence in carbon-nitrogen chemistry under sustained entropy flux.

19

Simulation of cell-size systems at long timescales with flexible protein structures

Yunas, K.; Singh, A.; Copeland, M. M.; Tytarenko, A. M.; Kundrotas, P. J.; Halfmann, R.; Kasyanov, P. O.; Feinberg, E. A.; Vakser, I. A.

2026-06-22 biophysics 10.64898/2026.06.20.733545 medRxiv

Top 0.3%

0.6%

Show abstract

Protein behavior inside cells is dominated by the crowded nature of the intracellular environment. Progress in structure determination of proteins and protein complexes, based on advances in Artificial Intelligence, provides an opportunity for structure-based modeling of cellular phenomena. Such modeling at the atomic resolution has been advanced by the traditional simulation techniques, e.g. molecular dynamics. A recently developed docking-based approach implements Markov Chain Monte Carlo sampling of intermolecular energy landscapes, offering several orders of magnitude faster simulation protocols. The approach allows addressing much longer trajectories of macromolecular systems in the crowded intracellular environment at atomic resolution. The sampling by design avoids low-probability (high-energy) states, which greatly accelerates the simulation process. A notable feature of this docking-based approach is the rigid body approximation of protein structures. The rigid-body approximation had been the primary direction in the protein docking field up until recent developments in deep learning. The rigid-body approach should be quite robust for the higher energy transient interactions that dominate the highly crowded cellular environment, as they likely involve relatively small conformational change. However, it is less applicable to the low-energy protein-protein complexes, especially those involving flexible regions. We addressed this problem by incorporating AlphaFold3 top models of the protein complexes in the mapping of the intermolecular energy landscape, as representative of the low-energy configurations of the protein assembly. By the nature of the AlphaFold predictions, these models involve appropriate conformational change between unbound and bound structures. These low-energy docking poses are combined with the rigid-body docking predictions that cover the multiplicity of the transient interactions. Such combination directly addresses the conformational flexibility of proteins upon binding along with the multiplicity of the transient protein encounters in the crowded cellular environment. SIGNIFICANCEProtein behavior inside cells is dominated by the crowded nature of intracellular environment. A recently developed approach allowed addressing long simulation trajectories of macromolecular systems in such environment at atomic resolution. A notable feature of this approach is the rigid body approximation in representation of the protein structures, which had been popular in the field up until the recent developments in artificial intelligence. However, such approximation is less applicable to stable protein-protein complexes, especially those involving flexible regions. We addressed this problem head-on by incorporating top deep learning-generated models of protein complexes. The new approach directly accounts for the flexibility of protein structures upon binding, along with the multiplicity of the transient protein encounters in the crowded cellular environment.

20

Molecular mechanism of water and glycerol transport through hydrophobic selectivity filter in the aquaporin homolog of Trypanosoma brucei

Parsa, P. M.; Sankararamakrishnan, R.

2026-05-01 biophysics 10.64898/2026.04.27.720980 medRxiv

Top 0.4%

0.6%

Show abstract

The protozoan parasite Trypanosoma brucei is implicated in deadly African sleeping sickness. Experimental studies show that T. brucei codes for three aquaporin homologs (TbAQP1 to TbAQP3). TbAQP2 has been established as the high affinity drug transporter of drugs pentamidine and melarsoprol. Mutation in TbAQP2 or its loss result in pentamidine-melarsoprol cross-resistance. TbAQP2 is also shown to transport water, glycerol and other solutes to respond to osmoregulation in the infected hosts or glycerol metabolism. Experimentally determined structures of TbAQP2 shows that it adopts the same aquaporin-like hourglass helical fold. However, the so called aromatic/arginine selectivity filter (Ar/R SF) in TbAQP2 has neither arginine nor aromatic residue and all four residues are hydrophobic. Mutation and functional studies have demonstrated the role of Ar/R SF residues in the transport and selectivity of solutes in aquaporin homologs. The intriguing question is how the completely hydrophobic Ar/R SF region enables the transport of water and glycerol molecules. In this study, we used computational approach to elucidate the molecular mechanism of water and glycerol transport. Our equilibrium molecular dynamics simulations showed that the number of water molecules transported by TbAQP2 is almost one order of magnitude higher than that of prototype water channel AQP1. Moreover, the residence time within TbAQP2 channel is much less compared to that found in AQP1. The relatively wider constriction, interactions of water molecules with the selectivity filter residues and the contact duration, all contribute to a large number of water molecules transported through TbAQP2 channel. Our umbrella sampling studies show that when glycerol is transported through TbAQP2, it participates in interactions with channel residues that can be considered as complimentary to that observed in prototype glycerol transporter GlpF. Our studies reveal the molecular mechanism of water and glycerol transport in TbAQP2 and establish that TbAQP2 is an efficient water transporter. Statement of SignificanceTrypanosoma brucei causes African sleeping sickness and a homolog of aquaporin, TbAQP2, is involved in the transport of drugs that are used to treat this disease. Developing anti-parasitic drugs requires the knowledge of molecular mechanism of the proteins function. TbAQP2 has been shown to transport water and glycerol. Permeating solutes have to pass through a narrow constriction region formed by all hydrophobic residues. In the present study, equilibrium molecular dynamics simulations showed that TbAQP2 transports water molecules faster in large quantity in comparison with mammalian AQP1. Higher water transport is due to relatively wider constriction and minimum water interactions with selectivity filter hydrophobic residues. Permeating glycerol molecule is involved in complementary interactions with the channel residues. Our studies reveal how water and glycerol are transported through hydrophobic selectivity filter in TbAQP2.